Correlation Clustering and Biclustering with Locally Bounded Errors
نویسندگان
چکیده
We consider a generalized version of the correlation clustering problem, defined as follows. Given a complete graph G whose edges are labeled with + or −, we wish to partition the graph into clusters while trying to avoid errors: + edges between clusters or − edges within clusters. Classically, one seeks to minimize the total number of such errors. We introduce a new framework that allows the objective to be a more general function of the number of errors at each vertex (for example, we may wish to minimize the number of errors at the worst vertex) and provide a rounding algorithm which converts “fractional clusterings” into discrete clusterings while causing only a constant-factor blowup in the number of errors at each vertex. This rounding algorithm yields constant-factor approximation algorithms for the discrete problem under a wide variety of objective functions.
منابع مشابه
Minimax Correlation Clustering and Biclustering: Bounding Errors Locally
We introduce a new agnostic clustering model, minimax correlation clustering, and a rounding algorithm tailored to the needs of this model. Given a graph whose edges are labeled with + or −, we wish to partition the graph into clusters while trying to avoid errors: + edges between clusters or − edges within clusters. Unlike classical correlation clustering, which seeks to minimize the total num...
متن کاملBi-correlation clustering algorithm for determining a set of co-regulated genes
MOTIVATION Biclustering has been emerged as a powerful tool for identification of a group of co-expressed genes under a subset of experimental conditions (measurements) present in a gene expression dataset. Several biclustering algorithms have been proposed till date. In this article, we address some of the important shortcomings of these existing biclustering algorithms and propose a new corre...
متن کاملBiclustering as a method for RNA local multiple sequence alignment
MOTIVATIONS Biclustering is a clustering method that simultaneously clusters both the domain and range of a relation. A challenge in multiple sequence alignment (MSA) is that the alignment of sequences is often intended to reveal groups of conserved functional subsequences. Simultaneously, the grouping of the sequences can impact the alignment; precisely the kind of dual situation biclustering ...
متن کاملGene Expression Data Knowledge Discovery using Global and Local Clustering
To understand complex biological systems, the research community has produced huge corpus of gene expression data. A large number of clustering approaches have been proposed for the analysis of gene expression data. However, extracting important biological knowledge is still harder. To address this task, clustering techniques are used. In this paper, hybrid Hierarchical k-Means algorithm is use...
متن کاملON LOCAL BOUNDEDNESS OF I-TOPOLOGICAL VECTOR SPACES
The notion of generalized locally bounded $I$-topological vectorspaces is introduced. Some of their important properties arestudied. The relationship between this kind of spaces and thelocally bounded $I$-topological vector spaces introduced by Wu andFang [Boundedness and locally bounded fuzzy topological vectorspaces, Fuzzy Math. 5 (4) (1985) 87$-$94] is discussed. Moreover, wealso use the fam...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016